10 research outputs found
Group Affect Prediction Using Multimodal Distributions
We describe our approach towards building an efficient predictive model to
detect emotions for a group of people in an image. We have proposed that
training a Convolutional Neural Network (CNN) model on the emotion heatmaps
extracted from the image, outperforms a CNN model trained entirely on the raw
images. The comparison of the models have been done on a recently published
dataset of Emotion Recognition in the Wild (EmotiW) challenge, 2017. The
proposed method achieved validation accuracy of 55.23% which is 2.44% above the
baseline accuracy, provided by the EmotiW organizers.Comment: This research paper has been accepted at Workshop on Computer Vision
for Active and Assisted Living, WACV 201
Conversational Machine Comprehension: a Literature Review
Conversational Machine Comprehension (CMC), a research track in
conversational AI, expects the machine to understand an open-domain natural
language text and thereafter engage in a multi-turn conversation to answer
questions related to the text. While most of the research in Machine Reading
Comprehension (MRC) revolves around single-turn question answering (QA),
multi-turn CMC has recently gained prominence, thanks to the advancement in
natural language understanding via neural language models such as BERT and the
introduction of large-scale conversational datasets such as CoQA and QuAC. The
rise in interest has, however, led to a flurry of concurrent publications, each
with a different yet structurally similar modeling approach and an inconsistent
view of the surrounding literature. With the volume of model submissions to
conversational datasets increasing every year, there exists a need to
consolidate the scattered knowledge in this domain to streamline future
research. This literature review attempts at providing a holistic overview of
CMC with an emphasis on the common trends across recently published models,
specifically in their approach to tackling conversational history. The review
synthesizes a generic framework for CMC models while highlighting the
differences in recent approaches and intends to serve as a compendium of CMC
for future researchers.Comment: Accepted to COLING 202
Relation Classification for Bleeding Events From Electronic Health Records Using Deep Learning Systems: An Empirical Study
BACKGROUND: Accurate detection of bleeding events from electronic health records (EHRs) is crucial for identifying and characterizing different common and serious medical problems. To extract such information from EHRs, it is essential to identify the relations between bleeding events and related clinical entities (eg, bleeding anatomic sites and lab tests). With the advent of natural language processing (NLP) and deep learning (DL)-based techniques, many studies have focused on their applicability for various clinical applications. However, no prior work has utilized DL to extract relations between bleeding events and relevant entities.
OBJECTIVE: In this study, we aimed to evaluate multiple DL systems on a novel EHR data set for bleeding event-related relation classification.
METHODS: We first expert annotated a new data set of 1046 deidentified EHR notes for bleeding events and their attributes. On this data set, we evaluated three state-of-the-art DL architectures for the bleeding event relation classification task, namely, convolutional neural network (CNN), attention-guided graph convolutional network (AGGCN), and Bidirectional Encoder Representations from Transformers (BERT). We used three BERT-based models, namely, BERT pretrained on biomedical data (BioBERT), BioBERT pretrained on clinical text (Bio+Clinical BERT), and BioBERT pretrained on EHR notes (EhrBERT).
RESULTS: Our experiments showed that the BERT-based models significantly outperformed the CNN and AGGCN models. Specifically, BioBERT achieved a macro F1 score of 0.842, outperforming both the AGGCN (macro F1 score, 0.828) and CNN models (macro F1 score, 0.763) by 1.4% (P \u3c .001) and 7.9% (P \u3c .001), respectively.
CONCLUSIONS: In this comprehensive study, we explored and compared different DL systems to classify relations between bleeding events and other medical concepts. On our corpus, BERT-based models outperformed other DL models for identifying the relations of bleeding-related entities. In addition to pretrained contextualized word representation, BERT-based models benefited from the use of target entity representation over traditional sequence representation
Fine-Tuning Bidirectional Encoder Representations From Transformers (BERT)-Based Models on Large-Scale Electronic Health Record Notes: An Empirical Study
BACKGROUND: The bidirectional encoder representations from transformers (BERT) model has achieved great success in many natural language processing (NLP) tasks, such as named entity recognition and question answering. However, little prior work has explored this model to be used for an important task in the biomedical and clinical domains, namely entity normalization.
OBJECTIVE: We aim to investigate the effectiveness of BERT-based models for biomedical or clinical entity normalization. In addition, our second objective is to investigate whether the domains of training data influence the performances of BERT-based models as well as the degree of influence.
METHODS: Our data was comprised of 1.5 million unlabeled electronic health record (EHR) notes. We first fine-tuned BioBERT on this large collection of unlabeled EHR notes. This generated our BERT-based model trained using 1.5 million electronic health record notes (EhrBERT). We then further fine-tuned EhrBERT, BioBERT, and BERT on three annotated corpora for biomedical and clinical entity normalization: the Medication, Indication, and Adverse Drug Events (MADE) 1.0 corpus, the National Center for Biotechnology Information (NCBI) disease corpus, and the Chemical-Disease Relations (CDR) corpus. We compared our models with two state-of-the-art normalization systems, namely MetaMap and disease name normalization (DNorm).
RESULTS: EhrBERT achieved 40.95% F1 in the MADE 1.0 corpus for mapping named entities to the Medical Dictionary for Regulatory Activities and the Systematized Nomenclature of Medicine-Clinical Terms (SNOMED-CT), which have about 380,000 terms. In this corpus, EhrBERT outperformed MetaMap by 2.36% in F1. For the NCBI disease corpus and CDR corpus, EhrBERT also outperformed DNorm by improving the F1 scores from 88.37% and 89.92% to 90.35% and 93.82%, respectively. Compared with BioBERT and BERT, EhrBERT outperformed them on the MADE 1.0 corpus and the CDR corpus.
CONCLUSIONS: Our work shows that BERT-based models have achieved state-of-the-art performance for biomedical and clinical entity normalization. BERT-based models can be readily fine-tuned to normalize any kind of named entities
Entity-Enriched Neural Models for Clinical Question Answering
We explore state-of-the-art neural models for question answering on
electronic medical records and improve their ability to generalize better on
previously unseen (paraphrased) questions at test time. We enable this by
learning to predict logical forms as an auxiliary task along with the main task
of answer span detection. The predicted logical forms also serve as a rationale
for the answer. Further, we also incorporate medical entity information in
these models via the ERNIE architecture. We train our models on the large-scale
emrQA dataset and observe that our multi-task entity-enriched models generalize
to paraphrased questions ~5% better than the baseline BERT model
Recommended from our members
Evidence Assisted Learning for Clinical Decision Support Systems
Clinical decision support systems (CDSS) provide intelligently filtered knowledge and patient-specific and population information to the clinicians, nursing staff and healthcare professionals. CDSS can significantly improve the quality, safety, efficiency and effectiveness of health care. Over the last decade, American hospitals have adopted electronic health records (EHRs) widely resulting in a massive collection of clinical notes such as admission notes, physician notes, nursing notes and discharge summaries. For the past couple of decades, most of the work in CDSS has been focused on developing knowledge-based systems using structured data such as medications and ICD codes. In contrast, the EHR notes incorporate rich and important information including adverse drug events, suicidal behaviors, and social determinants of health, all of which are substantially under-represented in the structured data. This presents a unique opportunity for natural language processing (NLP), with its ability to process a massive amount of EHR notes beyond the scope of human capability, to provide new clinical evidence previously missed out by any CDSS systems.
We contribute to the NLP and clinical community by developing a robust multi-task learning framework for CDSS. First, we identified causality between medication and its adverse drug reactions using a clinically standardized assessment technique called Naranjo Scale. Our multi-task learning framework takes a question from Naranjo Scale, along with a patient\u27s note to identify relevant evidence sentences and paragraphs in the note and predicts the final answer for the question. Second, we extracted suicide attempt (SA) and suicide ideation (SI) events from patients\u27 clinical notes. We created the first publicly available suicide attempt and ideation events (ScAN) dataset. We then built a multi-task learning model ScANER (Suicide Attempt and Ideation Attempts Retriever) to extract the relevant suicidal behavior evidence from clinical notes. Next, we deployed multiple parameter-efficient transfer learning techniques to fine-tune the ScANER model for different hospitals’ EHR datasets. By fine-tuning less than ~2% of ScANER’s parameters on a small annotated data, ScANER is able to maintain a similar performance. To provide evidence for population-level CDSS for suicide prevention, we identified risk factors for suicidal behaviors using large EHRs (~7 million patients). We found that patients with traumatic brain injury and/or post-traumatic stress disorder are more than twice as likely to have suicidal behavior as compared to the control population. We also studied the prevalence of other risk factors, such as social determinants of health, extracted from EHR notes using different NLP approaches
Inferring ADR causality by predicting the Naranjo Score from Clinical Notes
Clinical judgment studies are an integral part of drug safety surveillance and pharmacovigilance frameworks. They help quantify the causal relationship between medication and its adverse drug reactions (ADRs). To conduct such studies, physicians need to review patients\u27 charts manually to answer Naranjo questionnaire(1). In this paper, we propose a methodology to automatically infer causal relations from patients\u27 discharge summaries by combining the capabilities of deep learning and statistical learning models. We use Bidirectional Encoder Representations from Transformers (BERT)(2) to extract relevant paragraphs for each Naranjo question and then use a statistical learning model such as logistic regression to predict the Naranjo score and the causal relation between the medication and an ADR. Our methodology achieves a macro-averaged f1-score of 0.50 and weighted f1-score of 0.63
Recommended from our members
Relation Classification for Bleeding Events From Electronic Health Records Using Deep Learning Systems: An Empirical Study
Background: Accurate detection of bleeding events from electronic health records (EHRs) is crucial for identifying and characterizing different common and serious medical problems. To extract such information from EHRs, it is essential to identify the relations between bleeding events and related clinical entities (eg, bleeding anatomic sites and lab tests). With the advent of natural language processing (NLP) and deep learning (DL)-based techniques, many studies have focused on their applicability for various clinical applications. However, no prior work has utilized DL to extract relations between bleeding events and relevant entities. Objective: In this study, we aimed to evaluate multiple DL systems on a novel EHR data set for bleeding event-related relation classification. Methods: We first expert annotated a new data set of 1046 deidentified EHR notes for bleeding events and their attributes. On this data set, we evaluated three state-of-the-art DL architectures for the bleeding event relation classification task, namely, convolutional neural network (CNN), attention-guided graph convolutional network (AGGCN), and Bidirectional Encoder Representations from Transformers (BERT). We used three BERT-based models, namely, BERT pretrained on biomedical data (BioBERT), BioBERT pretrained on clinical text (Bio+Clinical BERT), and BioBERT pretrained on EHR notes (EhrBERT). Results: Our experiments showed that the BERT-based models significantly outperformed the CNN and AGGCN models. Specifically, BioBERT achieved a macro F1 score of 0.842, outperforming both the AGGCN (macro F1 score, 0.828) and CNN models (macro F1 score, 0.763) by 1.4% (P\u3c.001) and 7.9% (P\u3c.001), respectively. Conclusions: In this comprehensive study, we explored and compared different DL systems to classify relations between bleeding events and other medical concepts. On our corpus, BERT-based models outperformed other DL models for identifying the relations of bleeding-related entities. In addition to pretrained contextualized word representation, BERT-based models benefited from the use of target entity representation over traditional sequence representatio